Utilizing ARM Technique in Mining Textual Data
نویسنده
چکیده
Text mining, as one major school in Knowledge Discovery in Data (KDD), mines hidden patterns, rules, regularities and trends from textual data / non-database-data (i.e., text files, web documents, etc.). It is quite different from data mining (another well-known major school in KDD): the data structure of texts, dealt by text mining, is considered implicit, whereas traditional database-data, dealt by data mining, is relatively explicit / structured. With the appearance of various approaches in text preprocessing, “rich” texts can be converted to useful structured / semi-structured data. Hence, it is arguably doable to obtain knowledge from texts by utilizing the traditional data mining techniques, such as classification rule mining, clustering, Association Rule Mining (ARM), etc. In this paper, we summarize the wide usages of utilizing ARM technique in the field of text mining, Text Association Rule Mining (TARM). Our work is presented with the aim of supporting future work in text mining research.
منابع مشابه
CSCR001: Literature Survey
My PhD research focuses on Text Mining (TM), one major school in Knowledge Discovery in Data (KDD), and in particular the task of classification/categorization of documents using novel algorithms for the identification of hidden patterns within these documents. Two significant techniques of Data Mining (DM), another well-known major school in KDD, will be utilized to support the research: Assoc...
متن کاملUsing Supervised Clustering Technique to Classify Received Messages in 137 Call Center of Tehran City Council
Supervised clustering is a data mining technique that assigns a set of data to predefined classes by analyzing dataset attributes. It is considered as an important technique for information retrieval, management, and mining in information systems. Since customer satisfaction is the main goal of organizations in modern society, to meet the requirements, 137 call center of Tehran city council is ...
متن کاملUsing Supervised Clustering Technique to Classify Received Messages in 137 Call Center of Tehran City Council
Supervised clustering is a data mining technique that assigns a set of data to predefined classes by analyzing dataset attributes. It is considered as an important technique for information retrieval, management, and mining in information systems. Since customer satisfaction is the main goal of organizations in modern society, to meet the requirements, 137 call center of Tehran city council is ...
متن کاملMining Technique Using Association Rules Extraction
automatically extracting association rules from collections of textual documents. The technique called, Extracting Association Rules from Text (EART). It depends on keyword features for discover association rules amongst keywords labeling the documents. In this work, the EART system ignores the order in which the words occur, but instead focusing on the words and their statistical distributions...
متن کاملDistributed Higher Order Text Mining
-The burgeoning amount of textual data in distributed sources combined with the obstacles involved in creating and maintaining central repositories motivates the need for effective distributed information extraction and mining techniques. Recently, as the need to mine patterns across distributed databases has grown, Distributed Association Rule Mining (D-ARM) algorithms have been developed. The...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004